Scalable and Reliable Collaborative Spam Filters: Harnessing the Global Social Email Networks

نویسندگان

  • Joseph S. Kong
  • P. Oscar Boykin
  • Behnam Attaran Rezaei
  • Nima Sarshar
  • Vwani P. Roychowdhury
چکیده

We introduce a collaborative anti-spam system that is based on pervasive global social email networks. Essentially, we provide a solution to this open research problem: given a network of N users who are willing to share information collaboratively (e.g. the digests or ngerprints of known spams), how do we search for each user's content e ciently and reliably in a distributed manner with minimal tra c cost on the network? As a solution to this open problem, our proposed system employs the percolation search process, which makes the tra c generated due to queries for spam digests scale sublinearly as a function of N . However, in order to reap the bene ts of this novel percolation search algorithm, the node degree distribution of the underlying network must be heavy-tailed. Interestingly, latent global social email networks comprising of personal contacts possess a power-law heavy-tailed degree distribution, which renders itself an ideal natural platform to employ the percolation search algorithm. As a result, our proposed distributed spam lter requires no dedicated peer-to-peer (P2P) systems or centralized server-based systems. We have performed large-scale simulations and we nd that the system achieves a spam detection rate close to 100%, while the false positive rate is kept around zero. The bandwidth cost per user as well as the system-wide bandwidth cost are shown to be very low. Electrical Engineering Deptartment, University of California, Los Angeles, CA 90095. y Electrical and Computer Engineering Department, University of Florida, Gainesville, FL 32611

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Social network analysis of web links to eliminate false positives in collaborative anti-spam systems

The performance of today’s email anti-spam systems is primarily measured by the percentage of false positives (non-spam messages detected as spam) rather than by the percentage of false negatives (real spam messages left unblocked). One reliable anti-spam technique is the Universal Resource Locator (URL)-based filter, which is utilized by most collaborative signature-based filters. URL-based fi...

متن کامل

Dynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture

Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...

متن کامل

A New Hybrid Approach of K-Nearest Neighbors Algorithm with Particle Swarm Optimization for E-Mail Spam Detection

Emails are one of the fastest economic communications. Increasing email users has caused the increase of spam in recent years. As we know, spam not only damages user’s profits, time-consuming and bandwidth, but also has become as a risk to efficiency, reliability, and security of a network. Spam developers are always trying to find ways to escape the existing filters therefore new filters to de...

متن کامل

Symbiotic filtering for spam email detection

This paper presents a novel spam filtering technique called Symbiotic Filtering (SF) that aggregates distinct local filters from several users to improve the overall performance of spam detection. SF is an hybrid approach combining some features from both Collaborative (CF) and Content-Based Filtering (CBF). It allows for the use of social networks to personalize and tailor the set of filters t...

متن کامل

Optimization of Anti-Spam Systems with Multiobjective Evolutionary Algorithms

In this paper anti-spam filtering is presented as a cumbersome service, as opposed to a software product perspective. The huge human effort for setting up, adaptation, maintenance, and tuning of filters for spam detection in anti-spam systems is explained. Choosing the best importance scores for the spam filters is essential for the accuracy of any rules based anti-spam system, and is also one ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005